Supporting User-Defined Functions on Uncertain Data

نویسندگان

  • Thanh T. L. Tran
  • Yanlei Diao
  • Charles A. Sutton
  • Anna Liu
چکیده

Uncertain data management has become crucial in many sensing and scientific applications. As user-defined functions (UDFs) become widely used in these applications, an important task is to capture result uncertainty for queries that evaluate UDFs on uncertain data. In this work, we provide a general framework for supporting UDFs on uncertain data. Specifically, we propose a learning approach based on Gaussian processes (GPs) to compute approximate output distributions of a UDF when evaluated on uncertain input, with guaranteed error bounds. We also devise an online algorithm to compute such output distributions, which employs a suite of optimizations to improve accuracy and performance. Our evaluation using both real-world and synthetic functions shows that our proposed GP approach can outperform the state-of-the-art sampling approach with up to two orders of magnitude improvement for a variety of UDFs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Managing Continuous Uncertain Data by a Probabilistic XML Database Management System

Database systems are widely used in today’s world. Almost every information system contains one or more databases. From a traditional perspective, databases are used to store precise values about objects in the ’real world’. However, many information is uncertain or imprecise. Consider, for example, sensor applications. Sensors produce uncertain and imprecise data since readings of sensors are ...

متن کامل

Method of Collaborative Filtering Based on Uncertain User Interests Cluster

Recommender systems have been proven to be valuable means for Web online users to cope with the information overload and have become one of the most powerful and popular tools in electronic commerce. The suggestions provided are aimed at supporting their users in various decision-making processes, such as what items to buy, what music to listen, or what news to read. In the paper, we introduce ...

متن کامل

ارزشگذاری ویژگی‌های موجودیت‌های الگوی مفهومی اف.‌ آر. ‌بی.‌ آر. از دیدگاه کاربران فهرست‌های رایانه‌ای

Purpose: The aim is investigating views of three groups of library users (non-professionals, specialized professionals, librarians)  regardingthe importance and value of Attributes of entities of FRBR Conceptual model in supporting user tasks. Methodology: all attributes of entities of FRBR Conceptual model in supporting user tasks, was examined and evaluated through a descriptive-survey of th...

متن کامل

Access and Mobility Policy Control at the Network Edge

The fifth generation (5G) system architecture is defined as service-based and the core network functions are described as sets of services accessible through application programming interfaces (API). One of the components of 5G is Multi-access Edge Computing (MEC) which provides the open access to radio network functions through API. Using the mobile edge API third party analytics applications ...

متن کامل

ORION: Managing Uncertain (Sensor) Data

An important quality of sensor data is that it is often uncertain or imprecise. This uncertainty can be an inherent aspect of the data (e.g. due to known errors in the measuring device, such as the Gaussian error in GPS readings), or it may be introduced in order to achieve scalability [2, 1], or to ensure a certain level of privacy [4]. Existing database management systems provide virtually no...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2013